Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 145
Filtrar
1.
medRxiv ; 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38645167

RESUMO

Apart from ancestry, personal or environmental covariates may contribute to differences in polygenic score (PGS) performance. We analyzed effects of covariate stratification and interaction on body mass index (BMI) PGS (PGSBMI) across four cohorts of European (N=491,111) and African (N=21,612) ancestry. Stratifying on binary covariates and quintiles for continuous covariates, 18/62 covariates had significant and replicable R2 differences among strata. Covariates with the largest differences included age, sex, blood lipids, physical activity, and alcohol consumption, with R2 being nearly double between best and worst performing quintiles for certain covariates. 28 covariates had significant PGSBMI-covariate interaction effects, modifying PGSBMI effects by nearly 20% per standard deviation change. We observed overlap between covariates that had significant R2 differences among strata and interaction effects - across all covariates, their main effects on BMI were correlated with their maximum R2 differences and interaction effects (0.56 and 0.58, respectively), suggesting high-PGSBMI individuals have highest R2 and increase in PGS effect. Using quantile regression, we show the effect of PGSBMI increases as BMI itself increases, and that these differences in effects are directly related to differences in R2 when stratifying by different covariates. Given significant and replicable evidence for context-specific PGSBMI performance and effects, we investigated ways to increase model performance taking into account non-linear effects. Machine learning models (neural networks) increased relative model R2 (mean 23%) across datasets. Finally, creating PGSBMI directly from GxAge GWAS effects increased relative R2 by 7.8%. These results demonstrate that certain covariates, especially those most associated with BMI, significantly affect both PGSBMI performance and effects across diverse cohorts and ancestries, and we provide avenues to improve model performance that consider these effects.

2.
Nat Commun ; 15(1): 3384, 2024 Apr 22.
Artigo em Inglês | MEDLINE | ID: mdl-38649760

RESUMO

Polygenic variation unrelated to disease contributes to interindividual variation in baseline white blood cell (WBC) counts, but its clinical significance is uncharacterized. We investigated the clinical consequences of a genetic predisposition toward lower WBC counts among 89,559 biobank participants from tertiary care centers using a polygenic score for WBC count (PGSWBC) comprising single nucleotide polymorphisms not associated with disease. A predisposition to lower WBC counts was associated with a decreased risk of identifying pathology on a bone marrow biopsy performed for a low WBC count (odds-ratio = 0.55 per standard deviation increase in PGSWBC [95%CI, 0.30-0.94], p = 0.04), an increased risk of leukopenia (a low WBC count) when treated with a chemotherapeutic (n = 1724, hazard ratio [HR] = 0.78 [0.69-0.88], p = 4.0 × 10-5) or immunosuppressant (n = 354, HR = 0.61 [0.38-0.99], p = 0.04). A predisposition to benign lower WBC counts was associated with an increased risk of discontinuing azathioprine treatment (n = 1,466, HR = 0.62 [0.44-0.87], p = 0.006). Collectively, these findings suggest that there are genetically predisposed individuals who are susceptible to escalations or alterations in clinical care that may be harmful or of little benefit.


Assuntos
Predisposição Genética para Doença , Leucopenia , Herança Multifatorial , Polimorfismo de Nucleotídeo Único , Humanos , Contagem de Leucócitos , Masculino , Feminino , Leucopenia/genética , Leucopenia/sangue , Pessoa de Meia-Idade , Idoso , Adulto , Imunossupressores/uso terapêutico
3.
Artigo em Inglês | MEDLINE | ID: mdl-38613820

RESUMO

OBJECTIVES: Phenotyping is a core task in observational health research utilizing electronic health records (EHRs). Developing an accurate algorithm demands substantial input from domain experts, involving extensive literature review and evidence synthesis. This burdensome process limits scalability and delays knowledge discovery. We investigate the potential for leveraging large language models (LLMs) to enhance the efficiency of EHR phenotyping by generating high-quality algorithm drafts. MATERIALS AND METHODS: We prompted four LLMs-GPT-4 and GPT-3.5 of ChatGPT, Claude 2, and Bard-in October 2023, asking them to generate executable phenotyping algorithms in the form of SQL queries adhering to a common data model (CDM) for three phenotypes (ie, type 2 diabetes mellitus, dementia, and hypothyroidism). Three phenotyping experts evaluated the returned algorithms across several critical metrics. We further implemented the top-rated algorithms and compared them against clinician-validated phenotyping algorithms from the Electronic Medical Records and Genomics (eMERGE) network. RESULTS: GPT-4 and GPT-3.5 exhibited significantly higher overall expert evaluation scores in instruction following, algorithmic logic, and SQL executability, when compared to Claude 2 and Bard. Although GPT-4 and GPT-3.5 effectively identified relevant clinical concepts, they exhibited immature capability in organizing phenotyping criteria with the proper logic, leading to phenotyping algorithms that were either excessively restrictive (with low recall) or overly broad (with low positive predictive values). CONCLUSION: GPT versions 3.5 and 4 are capable of drafting phenotyping algorithms by identifying relevant clinical criteria aligned with a CDM. However, expertise in informatics and clinical experience is still required to assess and further refine generated algorithms.

4.
medRxiv ; 2024 Mar 13.
Artigo em Inglês | MEDLINE | ID: mdl-38559137

RESUMO

Hypertriglyceridemia (HTG) is a common cardiovascular risk factor characterized by elevated circulating triglyceride (TG) levels. Researchers have assessed the genetic factors that influence HTG in studies focused predominantly on individuals of European ancestry (EA). However, relatively little is known about the contribution of genetic variation to HTG in people of AA, potentially constraining research and treatment opportunities; the lipid profile for African ancestry (AA) populations differs from that of EA populations-which may be partially attributable to genetics. Our objective was to characterize genetic profiles among individuals of AA with mild-to-moderate HTG and severe HTG versus those with normal TGs by leveraging whole genome sequencing (WGS) data and longitudinal electronic health records (EHRs) available in the All of Us (AoU) program. We compared the enrichment of functional variants within five canonical TG metabolism genes, an AA-specific polygenic risk score for TGs, and frequencies of 145 known potentially causal TG variants between patients with HTG and normal TG among a cohort of AA patients (N=15,373). Those with mild-to-moderate HTG (N=342) and severe HTG (N≤20) were more likely to carry APOA5 p.S19W (OR=2.21, 95% CI [1.70-2.88], p=2.74×10 -9 and OR=4.16, 95% CI [1.39-12.45], p=0.01, respectively) than those with normal TG. They were also more likely to have an elevated (top 10%) PRS, elevated carriage of potentially causal variant alleles, and carry any genetic risk factor. Alternative definitions of HTG yielded comparable results. In conclusion, individuals of AA with HTG were enriched for genetic risk factors compared to individuals with normal TGs.

5.
BMC Med Inform Decis Mak ; 22(Suppl 2): 348, 2024 Mar 03.
Artigo em Inglês | MEDLINE | ID: mdl-38433189

RESUMO

BACKGROUND: Systemic lupus erythematosus (SLE) is a rare autoimmune disorder characterized by an unpredictable course of flares and remission with diverse manifestations. Lupus nephritis, one of the major disease manifestations of SLE for organ damage and mortality, is a key component of lupus classification criteria. Accurately identifying lupus nephritis in electronic health records (EHRs) would therefore benefit large cohort observational studies and clinical trials where characterization of the patient population is critical for recruitment, study design, and analysis. Lupus nephritis can be recognized through procedure codes and structured data, such as laboratory tests. However, other critical information documenting lupus nephritis, such as histologic reports from kidney biopsies and prior medical history narratives, require sophisticated text processing to mine information from pathology reports and clinical notes. In this study, we developed algorithms to identify lupus nephritis with and without natural language processing (NLP) using EHR data from the Northwestern Medicine Enterprise Data Warehouse (NMEDW). METHODS: We developed five algorithms: a rule-based algorithm using only structured data (baseline algorithm) and four algorithms using different NLP models. The first NLP model applied simple regular expression for keywords search combined with structured data. The other three NLP models were based on regularized logistic regression and used different sets of features including positive mention of concept unique identifiers (CUIs), number of appearances of CUIs, and a mixture of three components (i.e. a curated list of CUIs, regular expression concepts, structured data) respectively. The baseline algorithm and the best performing NLP algorithm were externally validated on a dataset from Vanderbilt University Medical Center (VUMC). RESULTS: Our best performing NLP model incorporated features from both structured data, regular expression concepts, and mapped concept unique identifiers (CUIs) and showed improved F measure in both the NMEDW (0.41 vs 0.79) and VUMC (0.52 vs 0.93) datasets compared to the baseline lupus nephritis algorithm. CONCLUSION: Our NLP MetaMap mixed model improved the F-measure greatly compared to the structured data only algorithm in both internal and external validation datasets. The NLP algorithms can serve as powerful tools to accurately identify lupus nephritis phenotype in EHR for clinical research and better targeted therapies.


Assuntos
Lúpus Eritematoso Sistêmico , Nefrite Lúpica , Humanos , Nefrite Lúpica/diagnóstico , Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Fenótipo , Doenças Raras
6.
NPJ Digit Med ; 7(1): 46, 2024 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-38409350

RESUMO

Drug repurposing represents an attractive alternative to the costly and time-consuming process of new drug development, particularly for serious, widespread conditions with limited effective treatments, such as Alzheimer's disease (AD). Emerging generative artificial intelligence (GAI) technologies like ChatGPT offer the promise of expediting the review and summary of scientific knowledge. To examine the feasibility of using GAI for identifying drug repurposing candidates, we iteratively tasked ChatGPT with proposing the twenty most promising drugs for repurposing in AD, and tested the top ten for risk of incident AD in exposed and unexposed individuals over age 65 in two large clinical datasets: (1) Vanderbilt University Medical Center and (2) the All of Us Research Program. Among the candidates suggested by ChatGPT, metformin, simvastatin, and losartan were associated with lower AD risk in meta-analysis. These findings suggest GAI technologies can assimilate scientific insights from an extensive Internet-based search space, helping to prioritize drug repurposing candidates and facilitate the treatment of diseases.

7.
medRxiv ; 2024 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-38196578

RESUMO

Objectives: Phenotyping is a core task in observational health research utilizing electronic health records (EHRs). Developing an accurate algorithm demands substantial input from domain experts, involving extensive literature review and evidence synthesis. This burdensome process limits scalability and delays knowledge discovery. We investigate the potential for leveraging large language models (LLMs) to enhance the efficiency of EHR phenotyping by generating high-quality algorithm drafts. Materials and Methods: We prompted four LLMs-GPT-4 and GPT-3.5 of ChatGPT, Claude 2, and Bard-in October 2023, asking them to generate executable phenotyping algorithms in the form of SQL queries adhering to a common data model (CDM) for three phenotypes (i.e., type 2 diabetes mellitus, dementia, and hypothyroidism). Three phenotyping experts evaluated the returned algorithms across several critical metrics. We further implemented the top-rated algorithms and compared them against clinician-validated phenotyping algorithms from the Electronic Medical Records and Genomics (eMERGE) network. Results: GPT-4 and GPT-3.5 exhibited significantly higher overall expert evaluation scores in instruction following, algorithmic logic, and SQL executability, when compared to Claude 2 and Bard. Although GPT-4 and GPT-3.5 effectively identified relevant clinical concepts, they exhibited immature capability in organizing phenotyping criteria with the proper logic, leading to phenotyping algorithms that were either excessively restrictive (with low recall) or overly broad (with low positive predictive values). Conclusion: GPT versions 3.5 and 4 are capable of drafting phenotyping algorithms by identifying relevant clinical criteria aligned with a CDM. However, expertise in informatics and clinical experience is still required to assess and further refine generated algorithms.

8.
J Am Med Inform Assoc ; 31(4): 1036-1041, 2024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38269642

RESUMO

INTRODUCTION: Phenotyping algorithms enable the interpretation of complex health data and definition of clinically relevant phenotypes; they have become crucial in biomedical research. However, the lack of standardization and transparency inhibits the cross-comparison of findings among different studies, limits large scale meta-analyses, confuses the research community, and prevents the reuse of algorithms, which results in duplication of efforts and the waste of valuable resources. RECOMMENDATIONS: Here, we propose five independent fundamental dimensions of phenotyping algorithms-complexity, performance, efficiency, implementability, and maintenance-through which researchers can describe, measure, and deploy any algorithms efficiently and effectively. These dimensions must be considered in the context of explicit use cases and transparent methods to ensure that they do not reflect unexpected biases or exacerbate inequities.


Assuntos
Pesquisa Biomédica , Registros Eletrônicos de Saúde , Algoritmos , Fenótipo , Padrões de Referência
9.
medRxiv ; 2024 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-38260403

RESUMO

Genome-wide association studies (GWAS) have been instrumental in identifying genetic associations for various diseases and traits. However, uncovering genetic underpinnings among traits beyond univariate phenotype associations remains a challenge. Multi-phenotype associations (MPA), or genetic pleiotropy, offer important insights into shared genes and pathways among traits, enhancing our understanding of genetic architectures of complex diseases. GWAS of biobank-linked electronic health record (EHR) data are increasingly being utilized to identify MPA among various traits and diseases. However, methodologies that can efficiently take advantage of distributed EHR to detect MPA are still lacking. Here, we introduce mixWAS, a novel algorithm that efficiently and losslessly integrates multiple EHRs via summary statistics, allowing the detection of MPA among mixed phenotypes while accounting for heterogeneities across EHRs. Simulations demonstrate that mixWAS outperforms the widely used MPA detection method, Phenome-wide association study (PheWAS), across diverse scenarios. Applying mixWAS to data from seven EHRs in the US, we identified 4,534 MPA among blood lipids, BMI, and circulatory diseases. Validation in an independent EHR data from UK confirmed 97.7% of the associations. mixWAS fundamentally improves the detection of MPA and is available as a free, open-source software.

10.
J Am Med Inform Assoc ; 31(2): 386-395, 2024 Jan 18.
Artigo em Inglês | MEDLINE | ID: mdl-38041473

RESUMO

OBJECTIVE: Pediatric patients have different diseases and outcomes than adults; however, existing phecodes do not capture the distinctive pediatric spectrum of disease. We aim to develop specialized pediatric phecodes (Peds-Phecodes) to enable efficient, large-scale phenotypic analyses of pediatric patients. MATERIALS AND METHODS: We adopted a hybrid data- and knowledge-driven approach leveraging electronic health records (EHRs) and genetic data from Vanderbilt University Medical Center to modify the most recent version of phecodes to better capture pediatric phenotypes. First, we compared the prevalence of patient diagnoses in pediatric and adult populations to identify disease phenotypes differentially affecting children and adults. We then used clinical domain knowledge to remove phecodes representing phenotypes unlikely to affect pediatric patients and create new phecodes for phenotypes relevant to the pediatric population. We further compared phenome-wide association study (PheWAS) outcomes replicating known pediatric genotype-phenotype associations between Peds-Phecodes and phecodes. RESULTS: The Peds-Phecodes aggregate 15 533 ICD-9-CM codes and 82 949 ICD-10-CM codes into 2051 distinct phecodes. Peds-Phecodes replicated more known pediatric genotype-phenotype associations than phecodes (248 vs 192 out of 687 SNPs, P < .001). DISCUSSION: We introduce Peds-Phecodes, a high-throughput EHR phenotyping tool tailored for use in pediatric populations. We successfully validated the Peds-Phecodes using genetic replication studies. Our findings also reveal the potential use of Peds-Phecodes in detecting novel genotype-phenotype associations for pediatric conditions. We expect that Peds-Phecodes will facilitate large-scale phenomic and genomic analyses in pediatric populations. CONCLUSION: Peds-Phecodes capture higher-quality pediatric phenotypes and deliver superior PheWAS outcomes compared to phecodes.


Assuntos
Registros Eletrônicos de Saúde , Estudo de Associação Genômica Ampla , Criança , Humanos , Estudos de Associação Genética , Genômica , Fenótipo , Polimorfismo de Nucleotídeo Único
11.
Nat Commun ; 14(1): 7836, 2023 Nov 30.
Artigo em Inglês | MEDLINE | ID: mdl-38036523

RESUMO

African Americans have a significantly higher risk of developing chronic kidney disease, especially focal segmental glomerulosclerosis -, than European Americans. Two coding variants (G1 and G2) in the APOL1 gene play a major role in this disparity. While 13% of African Americans carry the high-risk recessive genotypes, only a fraction of these individuals develops FSGS or kidney failure, indicating the involvement of additional disease modifiers. Here, we show that the presence of the APOL1 p.N264K missense variant, when co-inherited with the G2 APOL1 risk allele, substantially reduces the penetrance of the G1G2 and G2G2 high-risk genotypes by rendering these genotypes low-risk. These results align with prior functional evidence showing that the p.N264K variant reduces the toxicity of the APOL1 high-risk alleles. These findings have important implications for our understanding of the mechanisms of APOL1-associated nephropathy, as well as for the clinical management of individuals with high-risk genotypes that include the G2 allele.


Assuntos
Glomerulosclerose Segmentar e Focal , Humanos , Glomerulosclerose Segmentar e Focal/genética , Apolipoproteína L1/genética , Predisposição Genética para Doença , Fatores de Risco , Genótipo , Apolipoproteínas/genética
12.
Elife ; 122023 10 26.
Artigo em Inglês | MEDLINE | ID: mdl-37882666

RESUMO

Background: Two risk variants in the apolipoprotein L1 gene (APOL1) have been associated with increased susceptibility to sepsis in Black patients. However, it remains unclear whether APOL1 high-risk genotypes are associated with occurrence of either sepsis or sepsis-related phenotypes in patients hospitalized with infections, independent of their association with pre-existing severe renal disease. Methods: A retrospective cohort study of 2242 Black patients hospitalized with infections. We assessed whether carriage of APOL1 high-risk genotypes was associated with the risk of sepsis and sepsis-related phenotypes in patients hospitalized with infections. The primary outcome was sepsis; secondary outcomes were short-term mortality, and organ failure related to sepsis. Results: Of 2242 Black patients hospitalized with infections, 565 developed sepsis. Patients with high-risk APOL1 genotypes had a significantly increased risk of sepsis (odds ratio [OR]=1.29 [95% CI, 1.00-1.67; p=0.047]); however, this association was not significant after adjustment for pre-existing severe renal disease (OR = 1.14 [95% CI, 0.88-1.48; p=0.33]), nor after exclusion of those patients with pre-existing severe renal disease (OR = 0.99 [95% CI, 0.70-1.39; p=0.95]). APOL1 high-risk genotypes were significantly associated with the renal dysfunction component of the Sepsis-3 criteria (OR = 1.64 [95% CI, 1.21-2.22; p=0.001]), but not with other sepsis-related organ dysfunction or short-term mortality. The association between high-risk APOL1 genotypes and sepsis-related renal dysfunction was markedly attenuated by adjusting for pre-existing severe renal disease (OR = 1.36 [95% CI, 1.00-1.86; p=0.05]) and was nullified after exclusion of patients with pre-existing severe renal disease (OR = 1.16 [95% CI, 0.74-1.81; p=0.52]). Conclusions: APOL1 high-risk genotypes were associated with an increased risk of sepsis; however, this increased risk was attributable predominantly to pre-existing severe renal disease. Funding: This study was supported by R01GM120523 (QF), R01HL163854 (QF), R35GM131770 (CMS), HL133786 (WQW), and Vanderbilt Faculty Research Scholar Fund (QF). The dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Center's BioVU which is supported by institutional funding, the 1S10RR025141-01 instrumentation award, and by the CTSA grant UL1TR0004from NCATS/NIH. Additional funding provided by the NIH through grants P50GM115305 and U19HL065962. The authors wish to acknowledge the expert technical support of the VANTAGE and VANGARD core facilities, supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA068485) and Vanderbilt Vision Center (P30 EY08126). The funders had no role in design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.


When the body is fighting off an infection, the processes it uses to protect itself can sometimes overreact. This results in a condition known as sepsis which can cause life-threatening damage to multiple organs. In the United States, Black patients are 60-80% more likely to develop sepsis compared to individuals who identify as White; differences remain even after accounting for socio-economic status and presence of other illnesses. Recent work has suggested that two variants of the APOL1 gene which are almost exclusively found in people with African ancestry may be a contributing factor to this disparity. These 'high-risk' genetic variants have also been shown to increase the likelihood of kidney diseases. It is therefore possible that the elevated chance of sepsis is not directly linked to these variations of APOL1, but rather is the result of patients already having reduced kidney function. To understand the relationship between APOL1 and sepsis, Jiang et al. analyzed data from patients admitted to Vanderbilt University Medical Centre in the United States between 2000 and 2020. This included 2,242 patients who identified as Black and had been hospitalized with an infection. The analyses showed that 16% of these individuals were carriers of the APOL1 high-risk variants. The high-risk patients were more likely to experience sepsis and demonstrate kidney damage. But other organs commonly damaged by sepsis were not affected more in these individuals compared to the other 84% of patients who did not have these variants. Furthermore, when individuals with pre-existing kidney diseases were removed from this high-risk group, the increased likelihood of sepsis was no longer prominent. These findings suggest that the APOL1 variants do not directly increase the risk of sepsis, and this association is primarily due to patients with these genetic variations being more susceptible to kidney diseases. There are new drugs under development targeting the APOL1 variants. While these may provide protection against kidney diseases, they are unlikely to be successful at preventing or treating sepsis once a patient has been hospitalized with an infection.


Assuntos
Apolipoproteína L1 , Nefropatias , Sepse , Humanos , Apolipoproteína L1/genética , Genótipo , Estudos Retrospectivos , Sepse/complicações , Sepse/genética , Negro ou Afro-Americano
13.
Am J Hum Genet ; 110(11): 1950-1958, 2023 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-37883979

RESUMO

As large-scale genomic screening becomes increasingly prevalent, understanding the influence of actionable results on healthcare utilization is key to estimating the potential long-term clinical impact. The eMERGE network sequenced individuals for actionable genes in multiple genetic conditions and returned results to individuals, providers, and the electronic health record. Differences in recommended health services (laboratory, imaging, and procedural testing) delivered within 12 months of return were compared among individuals with pathogenic or likely pathogenic (P/LP) findings to matched individuals with negative findings before and after return of results. Of 16,218 adults, 477 unselected individuals were found to have a monogenic risk for arrhythmia (n = 95), breast cancer (n = 96), cardiomyopathy (n = 95), colorectal cancer (n = 105), or familial hypercholesterolemia (n = 86). Individuals with P/LP results more frequently received services after return (43.8%) compared to before return (25.6%) of results and compared to individuals with negative findings (24.9%; p < 0.0001). The annual cost of qualifying healthcare services increased from an average of $162 before return to $343 after return of results among the P/LP group (p < 0.0001); differences in the negative group were non-significant. The mean difference-in-differences was $149 (p < 0.0001), which describes the increased cost within the P/LP group corrected for cost changes in the negative group. When stratified by individual conditions, significant cost differences were observed for arrhythmia, breast cancer, and cardiomyopathy. In conclusion, less than half of individuals received billed health services after monogenic return, which modestly increased healthcare costs for payors in the year following return.


Assuntos
Neoplasias da Mama , Cardiomiopatias , Adulto , Humanos , Feminino , Estudos Prospectivos , Aceitação pelo Paciente de Cuidados de Saúde , Arritmias Cardíacas , Neoplasias da Mama/genética , Cardiomiopatias/genética
14.
medRxiv ; 2023 Aug 24.
Artigo em Inglês | MEDLINE | ID: mdl-37662278

RESUMO

Objective: Pediatric patients have different diseases and outcomes than adults; however, existing phecodes do not capture the distinctive pediatric spectrum of disease. We aim to develop specialized pediatric phecodes (Peds-Phecodes) to enable efficient, large-scale phenotypic analyses of pediatric patients. Materials and Methods: We adopted a hybrid data- and knowledge-driven approach leveraging electronic health records (EHRs) and genetic data from Vanderbilt University Medical Center to modify the most recent version of phecodes to better capture pediatric phenotypes. First, we compared the prevalence of patient diagnoses in pediatric and adult populations to identify disease phenotypes differentially affecting children and adults. We then used clinical domain knowledge to remove phecodes representing phenotypes unlikely to affect pediatric patients and create new phecodes for phenotypes relevant to the pediatric population. We further compared phenome-wide association study (PheWAS) outcomes replicating known pediatric genotype-phenotype associations between Peds-Phecodes and phecodes. Results: The Peds-Phecodes aggregate 15,533 ICD-9-CM codes and 82,949 ICD-10-CM codes into 2,051 distinct phecodes. Peds-Phecodes replicated more known pediatric genotype-phenotype associations than phecodes (248 versus 192 out of 687 SNPs, p<0.001). Discussion: We introduce Peds-Phecodes, a high-throughput EHR phenotyping tool tailored for use in pediatric populations. We successfully validated the Peds-Phecodes using genetic replication studies. Our findings also reveal the potential use of Peds-Phecodes in detecting novel genotype-phenotype associations for pediatric conditions. We expect that Peds-Phecodes will facilitate large-scale phenomic and genomic analyses in pediatric populations. Conclusion: Peds-Phecodes capture higher-quality pediatric phenotypes and deliver superior PheWAS outcomes compared to phecodes.

15.
medRxiv ; 2023 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-37662324

RESUMO

Polygenic variation unrelated to disease contributes to interindividual variation in baseline white blood cell (WBC) counts, but its clinical significance is undefined. We investigated the clinical consequences of a genetic predisposition toward lower WBC counts among 89,559 biobank participants from tertiary care centers using a polygenic score for WBC count (PGSWBC) comprising single nucleotide polymorphisms not associated with disease. A predisposition to lower WBC counts was associated with a decreased risk of identifying pathology on a bone marrow biopsy performed for a low WBC count (odds-ratio=0.55 per standard deviation increase in PGSWBC [95%CI, 0.30 - 0.94], p=0.04), an increased risk of leukopenia (a low WBC count) when treated with a chemotherapeutic (n=1,724, hazard ratio [HR]=0.78 [0.69 - 0.88], p=4.0×10-5) or immunosuppressant (n=354, HR=0.61 [0.38 - 0.99], p=0.04). A predisposition to benign lower WBC counts was associated with an increased risk of discontinuing azathioprine treatment (n=1,466, HR=0.62 [0.44 - 0.87], p=0.006). Collectively, these findings suggest that a WBC count polygenic score identifies individuals who are susceptible to escalations or alterations in clinical care that may be harmful or of little benefit.

16.
Nat Commun ; 14(1): 5562, 2023 09 09.
Artigo em Inglês | MEDLINE | ID: mdl-37689782

RESUMO

Genes act in concert with each other in specific contexts to perform their functions. Determining how these genes influence complex traits requires a mechanistic understanding of expression regulation across different conditions. It has been shown that this insight is critical for developing new therapies. Transcriptome-wide association studies have helped uncover the role of individual genes in disease-relevant mechanisms. However, modern models of the architecture of complex traits predict that gene-gene interactions play a crucial role in disease origin and progression. Here we introduce PhenoPLIER, a computational approach that maps gene-trait associations and pharmacological perturbation data into a common latent representation for a joint analysis. This representation is based on modules of genes with similar expression patterns across the same conditions. We observe that diseases are significantly associated with gene modules expressed in relevant cell types, and our approach is accurate in predicting known drug-disease pairs and inferring mechanisms of action. Furthermore, using a CRISPR screen to analyze lipid regulation, we find that functionally important players lack associations but are prioritized in trait-associated modules by PhenoPLIER. By incorporating groups of co-expressed genes, PhenoPLIER can contextualize genetic associations and reveal potential targets missed by single-gene strategies.


Assuntos
Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Epistasia Genética , Causalidade , Redes Reguladoras de Genes , Transcriptoma
17.
Front Pharmacol ; 14: 1257700, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37745051

RESUMO

Background: Alzheimer's disease (AD) is a debilitating neurodegenerative condition with few treatment options available. Drug repurposing studies have sought to identify existing drugs that could be repositioned to treat AD; however, the effectiveness of drug repurposing for AD remains unclear. This review systematically analyzes the progress made in drug repurposing for AD throughout the last decade, summarizing the suggested drug candidates and analyzing changes in the repurposing strategies used over time. We also examine the different types of data that have been leveraged to validate suggested drug repurposing candidates for AD, which to our knowledge has not been previous investigated, although this information may be especially useful in appraising the potential of suggested drug repurposing candidates. We ultimately hope to gain insight into the suggested drugs representing the most promising repurposing candidates for AD. Methods: We queried the PubMed database for AD drug repurposing studies published between 2012 and 2022. 124 articles were reviewed. We used RxNorm to standardize drug names across the reviewed studies, map drugs to their constituent ingredients, and identify prescribable drugs. We used the Anatomical Therapeutic Chemical (ATC) Classification System to group drugs. Results: 573 unique drugs were proposed for repurposing in AD over the last 10 years. These suggested repurposing candidates included drugs acting on the nervous system (17%), antineoplastic and immunomodulating agents (16%), and drugs acting on the cardiovascular system (12%). Clozapine, a second-generation antipsychotic medication, was the most frequently suggested repurposing candidate (N = 6). 61% (76/124) of the reviewed studies performed a validation, yet only 4% (5/124) used real-world data for validation. Conclusion: A large number of potential drug repurposing candidates for AD has accumulated over the last decade. However, among these drugs, no single drug has emerged as the top candidate, making it difficult to establish research priorities. Validation of drug repurposing hypotheses is inconsistently performed, and real-world data has been critically underutilized for validation. Given the urgent need for new AD therapies, the utility of real-world data in accelerating identification of high-priority candidates for AD repurposing warrants further investigation.

18.
medRxiv ; 2023 Aug 04.
Artigo em Inglês | MEDLINE | ID: mdl-37577628

RESUMO

Black Americans have a significantly higher risk of developing chronic kidney disease (CKD), especially focal segmental glomerulosclerosis (FSGS), than European Americans. Two coding variants (G1 and G2) in the APOL1 gene play a major role in this disparity. While 13% of Black Americans carry the high-risk recessive genotypes, only a fraction of these individuals develops FSGS or kidney failure, indicating the involvement of additional disease modifiers. Here, we show that the presence of the APOL1 p.N264K missense variant, when co-inherited with the G2 APOL1 risk allele, substantially reduces the penetrance of the G1G2 and G2G2 high-risk genotypes by rendering these genotypes low-risk. These results align with prior functional evidence showing that the p.N264K variant reduces the toxicity of the APOL1 high-risk alleles. These findings have important implications for our understanding of the mechanisms of APOL1 -associated nephropathy, as well as for the clinical management of individuals with high-risk genotypes that include the G2 allele.

19.
BMC Rheumatol ; 7(1): 24, 2023 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-37550754

RESUMO

BACKGROUND: Antinuclear antibodies (ANA) are antibodies present in several autoimmune disorders. However, a large proportion of the general population (20%) also have a positive test; very few of these individuals will develop an autoimmune disease, and the clinical impact of a positive ANA in them is not known. Thus, we test the hypothesis that ANA + test reflects a state of immune dysregulation that alters risk for some clinical disorders in individuals without an autoimmune disease. METHODS: We performed high throughput association analyses in a case-control study using real world data from the de-identified electronic health record (EHR) system from Vanderbilt University Medical Center. The study population included individuals with an ANA titer ≥ 1:80 at any time (ANA +) and those with negative results (ANA-). The cohort was stratified into sub-cohorts of individuals with and without an autoimmune disease. A phenome-wide association study (PheWAS) adjusted by sex, year of birth, race, and length of follow-up was performed in the study cohort and in the sub-cohorts. As secondary analyses, only clinical diagnoses after ANA testing were included in the analyses. RESULTS: The cohort included 70,043 individuals: 49,546 without and 20,497 with an autoimmune disease, 26,579 were ANA + and 43,464 ANA-. In the study cohort and the sub-cohort with autoimmune disease, ANA + was associated (P ≤ 5 × 10-5) with 88 and 136 clinical diagnoses respectively, including lupus (OR ≥ 5.4, P ≤ 7.8 × 10-202) and other autoimmune diseases and complications. In the sub-cohort without autoimmune diseases, ANA + was associated with increased risk of Raynaud's syndrome (OR ≥ 2.1) and alveolar/perialveolar-related pneumopathies (OR ≥ 1.4) and decreased risk of hepatitis C, tobacco use disorders, mood disorders, convulsions, fever of unknown origin, and substance abuse disorders (OR ≤ 0.8). Analyses including only diagnoses after ANA testing yielded similar results. CONCLUSION: A positive ANA test, in addition to known associations with autoimmune diseases, Raynaud's phenomenon, and idiopathic fibrosing alveolitis related disorders, is associated with decreased prevalence of several non-autoimmune diseases.

20.
medRxiv ; 2023 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-37461512

RESUMO

Drug repurposing represents an attractive alternative to the costly and time-consuming process of new drug development, particularly for serious, widespread conditions with limited effective treatments, such as Alzheimer's disease (AD). Emerging generative artificial intelligence (GAI) technologies like ChatGPT offer the promise of expediting the review and summary of scientific knowledge. To examine the feasibility of using GAI for identifying drug repurposing candidates, we iteratively tasked ChatGPT with proposing the twenty most promising drugs for repurposing in AD, and tested the top ten for risk of incident AD in exposed and unexposed individuals over age 65 in two large clinical datasets: 1) Vanderbilt University Medical Center and 2) the All of Us Research Program. Among the candidates suggested by ChatGPT, metformin, simvastatin, and losartan were associated with lower AD risk in meta-analysis. These findings suggest GAI technologies can assimilate scientific insights from an extensive Internet-based search space, helping to prioritize drug repurposing candidates and facilitate the treatment of diseases.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...